智能论文笔记

Real-time Virtual Intraoperative CT for Image Guided Surgery

Yangming Li , Neeraja Konuthula , Ian M. Humphreys , Kris Moe , Blake Hannaford , Randall Bly

分类：计算机视觉 | 机器学习 | 机器人

2021-12-05

抽象的。目的：本文提出了一种用于产生虚拟术中CT扫描的方案，以改善内窥镜窦手术（ESS）的手术完整性。方法：该工作呈现三种方法，基于尖端运动，基于尖端轨迹的基于仪器，以及基于仪器，以及虚拟术中CT生成的非参数平滑和高斯过程回归。结果：所提出的方法研究，并在尸体上进行的ESS进行了比较。外科结果表明，所有三种方法都改善了骰子相似系数> 86％，F分数> 92％和精度> 89.91％。发现基于尖端轨迹的方法具有最佳性能，并在外科完整性评估中获得了96.87％的精度。结论：这项工作表明，虚拟术中CT扫描改善了实际手术场景与参考模型之间的一致性，并提高了ESS中的手术完整性。与实际的术中CT扫描相比，该方案对现有的外科议定书没有影响，不需要除了最多的ESS中已经提供的额外硬件克服了高成本，重复辐射和由实际术中引起的细长麻醉CTS，并在ESS中实用。

translated by 谷歌翻译

Real-time Informative Surgical Skill Assessment with Gaussian Process Learning

Yangming Li , Randall Bly , Sarah Akkina , Rajeev C. Saxena , Ian Humphreys , Mark Whipple , Kris Moe , Blake Hannaford

分类：机器学习

2021-12-05

内镜窦和头骨基础手术（Essbss）是一个具有挑战性和潜在的危险的外科手术，客观技能评估是提高手术训练有效性的关键组成部分，重新验证外科医生的技能，并降低手术创伤和并发症手术室的速度。由于外科手术的复杂性，操作风格的变化，以及新的外科技能的快速发展，外科技能评估仍然是一个具有挑战性的问题。这项工作提出了一种新颖的高斯过程学习的启发式自动客观外科手术技能评估方法。不同于经典的外科技能评估算法，所提出的方法1）利用外科仪器相对运动中的运动学特征，而不是使用特定的外科任务或统计数据实时评估技能; 2）提供信息丰富的反馈，而不是总结分数; 3）能够逐步从新数据逐步学习，而不是根据固定的数据集。该方法将仪器运动投射到内窥镜坐标中以减少数据维度。然后，它提取投影数据的运动学特征，并学习外科技能水平与高斯过程学习技术的特征之间的关系。该方法在全内镜颅底和尸体上的鼻窦手术中核实。这些手术具有不同的病理学，需要不同的治疗并具有不同的复杂性。实验结果表明，该方法达到了100 \％的预测精度，用于完整的外科手术和90 \％的实时预测评估精度。

translated by 谷歌翻译

Deep Active Learning Using Barlow Twins

Jaya Krishna Mandivarapu , Blake Camp , Rolando Estrada

分类：计算机视觉 | 人工智能

2022-12-30

The generalisation performance of a convolutional neural networks (CNN) is majorly predisposed by the quantity, quality, and diversity of the training images. All the training data needs to be annotated in-hand before, in many real-world applications data is easy to acquire but expensive and time-consuming to label. The goal of the Active learning for the task is to draw most informative samples from the unlabeled pool which can used for training after annotation. With total different objective, self-supervised learning which have been gaining meteoric popularity by closing the gap in performance with supervised methods on large computer vision benchmarks. self-supervised learning (SSL) these days have shown to produce low-level representations that are invariant to distortions of the input sample and can encode invariance to artificially created distortions, e.g. rotation, solarization, cropping etc. self-supervised learning (SSL) approaches rely on simpler and more scalable frameworks for learning. In this paper, we unify these two families of approaches from the angle of active learning using self-supervised learning mainfold and propose Deep Active Learning using BarlowTwins(DALBT), an active learning method for all the datasets using combination of classifier trained along with self-supervised loss framework of Barlow Twins to a setting where the model can encode the invariance of artificially created distortions, e.g. rotation, solarization, cropping etc.

translated by 谷歌翻译

The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich Regimes

Alexander Atanasov , Blake Bordelon , Sabarish Sainathan , Cengiz Pehlevan

分类： (统计)机器学习 | 机器学习

2022-12-23

For small training set sizes $P$, the generalization error of wide neural networks is well-approximated by the error of an infinite width neural network (NN), either in the kernel or mean-field/feature-learning regime. However, after a critical sample size $P^*$, we empirically find the finite-width network generalization becomes worse than that of the infinite width network. In this work, we empirically study the transition from infinite-width behavior to this variance limited regime as a function of sample size $P$ and network width $N$. We find that finite-size effects can become relevant for very small dataset sizes on the order of $P^* \sim \sqrt{N}$ for polynomial regression with ReLU networks. We discuss the source of these effects using an argument based on the variance of the NN's final neural tangent kernel (NTK). This transition can be pushed to larger $P$ by enhancing feature learning or by ensemble averaging the networks. We find that the learning curve for regression with the final NTK is an accurate approximation of the NN learning curve. Using this, we provide a toy model which also exhibits $P^* \sim \sqrt{N}$ scaling and has $P$-dependent benefits from feature learning.

translated by 谷歌翻译

HACA3: A Unified Approach for Multi-site MR Image Harmonization

Lianrui Zuo , Yihao Liu , Yuan Xue , Blake E. Dewey , Murat Bilgel , Ellen M. Mowry , Scott D. Newsome , Peter A. Calabresi , Susan M. Resnick , Jerry L. Prince

分类：计算机视觉

2022-12-12

The lack of standardization is a prominent issue in magnetic resonance (MR) imaging. This often causes undesired contrast variations due to differences in hardware and acquisition parameters. In recent years, MR harmonization using image synthesis with disentanglement has been proposed to compensate for the undesired contrast variations. Despite the success of existing methods, we argue that three major improvements can be made. First, most existing methods are built upon the assumption that multi-contrast MR images of the same subject share the same anatomy. This assumption is questionable since different MR contrasts are specialized to highlight different anatomical features. Second, these methods often require a fixed set of MR contrasts for training (e.g., both Tw-weighted and T2-weighted images must be available), which limits their applicability. Third, existing methods generally are sensitive to imaging artifacts. In this paper, we present a novel approach, Harmonization with Attention-based Contrast, Anatomy, and Artifact Awareness (HACA3), to address these three issues. We first propose an anatomy fusion module that enables HACA3 to respect the anatomical differences between MR contrasts. HACA3 is also robust to imaging artifacts and can be trained and applied to any set of MR contrasts. Experiments show that HACA3 achieves state-of-the-art performance under multiple image quality metrics. We also demonstrate the applicability of HACA3 on downstream tasks with diverse MR datasets acquired from 21 sites with different field strengths, scanner platforms, and acquisition protocols.

translated by 谷歌翻译

Transfer Entropy Bottleneck: Learning Sequence to Sequence Information Transfer

Damjan Kalajdzievski , Ximeng Mao , Pascal Fortier-Poisson , Guillaume Lajoie , Blake Richards

分类：机器学习

2022-11-29

When presented with a data stream of two statistically dependent variables, predicting the future of one of the variables (the target stream) can benefit from information about both its history and the history of the other variable (the source stream). For example, fluctuations in temperature at a weather station can be predicted using both temperatures and barometric readings. However, a challenge when modelling such data is that it is easy for a neural network to rely on the greatest joint correlations within the target stream, which may ignore a crucial but small information transfer from the source to the target stream. As well, there are often situations where the target stream may have previously been modelled independently and it would be useful to use that model to inform a new joint model. Here, we develop an information bottleneck approach for conditional learning on two dependent streams of data. Our method, which we call Transfer Entropy Bottleneck (TEB), allows one to learn a model that bottlenecks the directed information transferred from the source variable to the target variable, while quantifying this information transfer within the model. As such, TEB provides a useful new information bottleneck approach for modelling two statistically dependent streams of data in order to make predictions about one of them.

translated by 谷歌翻译

Contrastive introspection (ConSpec) to rapidly identify invariant prototypes for success in RL

Chen Sun , Wannan Yang , Benjamin Alsbury-Nealy , Yoshua Bengio , Blake Richards

分类：机器学习 | 人工智能

2022-10-12

Reinforcement learning (RL) algorithms have achieved notable success in recent years, but still struggle with fundamental issues in long-term credit assignment. It remains difficult to learn in situations where success is contingent upon multiple critical steps that are distant in time from each other and from a sparse reward; as is often the case in real life. Moreover, how RL algorithms assign credit in these difficult situations is typically not coded in a way that can rapidly generalize to new situations. Here, we present an approach using offline contrastive learning, which we call contrastive introspection (ConSpec), that can be added to any existing RL algorithm and addresses both issues. In ConSpec, a contrastive loss is used during offline replay to identify invariances among successful episodes. This takes advantage of the fact that it is easier to retrospectively identify the small set of steps that success is contingent upon than it is to prospectively predict reward at every step taken in the environment. ConSpec stores this knowledge in a collection of prototypes summarizing the intermediate states required for success. During training, arrival at any state that matches these prototypes generates an intrinsic reward that is added to any external rewards. As well, the reward shaping provided by ConSpec can be made to preserve the optimal policy of the underlying RL agent. The prototypes in ConSpec provide two key benefits for credit assignment: (1) They enable rapid identification of all the critical states. (2) They do so in a readily interpretable manner, enabling out of distribution generalization when sensory features are altered. In summary, ConSpec is a modular system that can be added to any existing RL algorithm to improve its long-term credit assignment.

translated by 谷歌翻译

Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering

Tavish McDonald , Brian Tsan , Amar Saini , Juanita Ordonez , Luis Gutierrez , Phan Nguyen , Blake Mason , Brenda Ng

分类：自然语言处理 | 人工智能 | 机器学习

2022-10-04

Researchers produce thousands of scholarly documents containing valuable technical knowledge. The community faces the laborious task of reading these documents to identify, extract, and synthesize information. To automate information gathering, document-level question answering (QA) offers a flexible framework where human-posed questions can be adapted to extract diverse knowledge. Finetuning QA systems requires access to labeled data (tuples of context, question and answer). However, data curation for document QA is uniquely challenging because the context (i.e. answer evidence passage) needs to be retrieved from potentially long, ill-formatted documents. Existing QA datasets sidestep this challenge by providing short, well-defined contexts that are unrealistic in real-world applications. We present a three-stage document QA approach: (1) text extraction from PDF; (2) evidence retrieval from extracted texts to form well-posed contexts; (3) QA to extract knowledge from contexts to return high-quality answers -- extractive, abstractive, or Boolean. Using QASPER for evaluation, our detect-retrieve-comprehend (DRC) system achieves a +7.19 improvement in Answer-F1 over existing baselines while delivering superior context selection. Our results demonstrate that DRC holds tremendous promise as a flexible framework for practical scientific document QA.

translated by 谷歌翻译

DEQGAN: Learning the Loss Function for PINNs with Generative Adversarial Networks

Blake Bullwinkel , Dylan Randle , Pavlos Protopapas , David Sondak

分类：机器学习

2022-09-15

微分方程的解决方案具有重要的科学和工程意义。物理知识的神经网络（PINN）已成为解决微分方程的有前途方法，但它们缺乏使用任何特定损失函数的理论理由。这项工作提出了微分方程gan（DEQGAN），这是一种使用生成对抗网络来求解微分方程的新方法，以“学习损失函数”以优化神经网络。在十二个普通和部分微分方程的套件上呈现结果，包括非线性汉堡，艾伦·卡恩，汉密尔顿和改良的爱因斯坦的重力方程，我们表明deqgan可以比使用$ pinn的均方一数级别的均方一数级别。 L_2 $，$ L_1 $和HUBER损失功能。我们还表明，Deqgan可以实现与流行数值方法竞争的解决方案精确度。最后，我们提出了两种方法，以提高Deqgan对不同的高参数设置的鲁棒性。

translated by 谷歌翻译

Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

Tiffany J. Callahan , Adrianne L. Stefanski , Jordan M. Wyrwa , Chenjie Zeng , Anna Ostropolets , Juan M. Banda , William A. Baumgartner Jr. , Richard D. Boyce , Elena Casiraghi , Ben D. Coleman

分类：人工智能

2022-09-10

通用数据模型解决了标准化电子健康记录（EHR）数据的许多挑战，但无法将其集成深度表型所需的资源。开放的生物学和生物医学本体论（OBO）铸造本体论提供了可用于生物学知识的语义计算表示，并能够整合多种生物医学数据。但是，将EHR数据映射到OBO Foundry本体论需要大量的手动策展和域专业知识。我们介绍了一个框架，用于将观察性医学成果合作伙伴关系（OMOP）标准词汇介绍给OBO铸造本体。使用此框架，我们制作了92,367条条件，8,615种药物成分和10,673个测量结果的映射。域专家验证了映射准确性，并且在24家医院进行检查时，映射覆盖了99％的条件和药物成分和68％的测量结果。最后，我们证明OMOP2OBO映射可以帮助系统地识别可能受益于基因检测的未诊断罕见病患者。

translated by 谷歌翻译